BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data